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Abstract — Orthogonality is a much desired property for MIMO 
coding. It enables symbol-wise decoding, where the errors in 
other symbol estimates do not affect the result, thus providing 
an optimality that is worth pursuing. Another beneficial property 
is a low complexity soft decision decoder, which for orthogonal 
complex MIMO codes is known for two transmit (Tx) antennas 
i.e. for the Alamouti code. We propose novel soft decision 
decoders for the orthogonal complex MIMO codes on three and 
four Tx antennas and extend the old result of maximal ratio 
combining (MRC) to cover all orthogonal codes up to four Tx 
antennas. 

As a rule, a sophisticated transmission scheme encompasses 
forward error correction (FEC) coding, and its performance is 
measured at the FEC decoder instead of at the MIMO decoder. 
We introduce the receiver structure that delivers the MIMO 
decoder's soft decisions to the demodulator, which in turn cranks 
out the logarithm of likelihood ratio (LLR) of each bit and 
delivers them to the FEC decoder. This makes a significant 
improvement on the receiver, where a maximum likelihood (ML) 
MIMO decoder makes hard decisions at a too early stage. 
Further, the additional gain is achieved with stunningly low 
complexity. 

Index Terms — Diversity methods, maximal ratio combining, 
MIMO systems, orthogonal codes, soft decision decoding. 

I. Introduction 

MULTIPATH propagation is a key feature of wireless 
channels. It has traditionally been conceived of as 
a bottleneck that hampers the creation of efficient wireless 
communication systems. It is a longstanding problem that 
has been addressed with several relatively effective solutions 
e.g. frequency hopping in GSM and spreading of the radio 
spectrum in UMTS. One well established method involves 
using multiple antennas on the receiver (Rx) side. In 1959 
D.G. Brennan proposed an algorithm to combine the received 
constituent signals so that the signal-to-noise power ratio 
(SNR) of the compound signal becomes the sum of the 
constituent SNRs at the multiple Rx antennas. In the seminal 
paper [1], Brennan also proved the intuitive assumption that 
this is the highest achievable compound SNR in the case 
of uncorrelated additive noise. Brennan called the algorithm 
'maximal ratio combining' (MRC), and since it is optimum 
in single-in-multiple-out (SIMO) transmission systems, it can 
be considered a benchmark in comparison to more recently 
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proposed multiple-in-multiple-out (MIMO) systems. We call 
the highest attainable compound SNR the 'MRC limit', which 
is evidently an important optimality criterion in an MIMO 
receiver. 

The era of MIMO coding really took off after the publica- 
tion of the Alamouti code in 1998 Q). It is a rate 1 orthogonal 
code for two Tx antennas and - as shown by Alamouti - 
provides the same diversity gain per antenna link as MRC 
i.e. 1x2 SIMO with MRC has the same diversity order 
as 2 x 1 multiple-in-single-out (MISO) with the Alamouti 
code. The Alamouti code is a perfect code, soft decoding 
of which is simple. Moreover, it is straightforward to show 
that in the case of the Alamouti code, the combiner of the 
signals from multiple Rx antennas reaches the MRC limit. 
Since it has such beneficial properties it is easy to understand 
the popularity of the Alamouti code. In fact, it is not so easy 
to surpass the Alamouti codes performance and still preserve 
low complexity in an MIMO scheme. Thus, it can be seen as 
another benchmark. 

The Alamouti code was introduced as a space-time (ST) 
code, but we prefer using the more general term 'MIMO code' . 
This is because the codes themselves do not prevent coding 
over frequencies and the term MIMO code applies to both ST 
and space-frequency (SF) coding as well as to their combina- 
tion space-time-frequency (STF) coding. More generally, it is a 
question of diversity in terms of the number of branches from 
the Tx to the Rx side of MIMO; preferably these branches 
have a low correlation with one another. Whether the diversity 
reflects space, time or frequency is irrelevant for an MIMO 
code. 

The invention of the Alamouti code triggered a search 
for other orthogonal MIMO codes, preferably applying to 
more than two Tx antennas. An extensive study of them was 
published as early as in 1999 by Tarokh et al. f3j, who proved 
that for complex constellations, the Alamouti code is the only 
rate one orthogonal code; for three and four Tx antennas, there 
are rate 3/4 codes. Further, Tarokh et al. proved the existence 
of complex MIMO codes with a rate of 1/2 for any number of 
Tx antennas. Their simulation results presented in [4| reveal 
that the orthogonal rate 3/4 codes can outperform the Alamouti 
code despite their lower channel data rate. Hence, they clearly 
have a larger potential for diversity gain. A closer look at [4|, 
however, gives rise to some questions as well. Specifically, 
Tarokh et al. apply the maximum likelihood (ML) decoder, 
which - despite being optimum in that it cranks out the most 
likely set of transmitted constellation points - is unable to pass 
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any reliability information to the FEC code decoder. This is 
not a problem in a theoretical study that omits the FEC code, 
but in the real world, the hard decisions at the FEC decoder 
input often reduce the achievable coding gain by 2 to 2.5 dB. 
This is a significant loss of ML decoding if it takes place 
at a very early stage of the receiver, since an FEC code is 
an essential part of any wireless digital transmission system. 
There was no FEC code in |4], and the theoretical results of 
this eminent paper should be interpreted carefully when a real 
transmission system is a matter of concern. 

An exhaustive study of the orthogonal codes was published 
by Liang in 2003 @, where some methods to create high rate 
orthogonal complex MIMO codes were given. Furthermore, 
the generator matrices for the rate 3/4 codes for three and 
four Tx antennas were written on the form, which include 
only complex symbols and their conjugates. The quest for the 
maximum rate of the orthogonal complex MIMO codes for any 
number of Tx antennas was finally resolved in [5|, where it 
was proven that for any even 2m and odd 2m — 1 number 
of Tx antennas the largest possible rate is (m + 1) /2m. An 
equivalent bound was also presented by Wang et Xia (5) in 
a paper that was published in parallel with |5|. The bound in 
Wang et Xia was written lucidly as 

\N Tx /2] + 1 



r < 



(1) 



2 rA^ Tx /2l 

where iVr x is the number of transmit antennas and \x~\ stands 
for the ceiling function, i.e. the smallest integer > x. Wang 
et Xia also introduced the concept of generalised orthogonal 
codes, where the rate was upper bounded by 4/5. This bound 
is loose in order to see if competitive generalised orthogonal 
complex MIMO code designs will be found in the future. 

Orthogonality is a desired property, but it does not nec- 
essarily yield the best performance of the MIMO codes - 
especially when the measure is the error rate at the MIMO 
decoder output. Even the Alamouti code can be beaten by a 
sophisticated algebraic code, which modifies the constellation 
in a way that maximises the distances between the transmitted 
symbol sets 1171, |9|. Furthermore, there are full rate algebraic 
MIMO codes for multiple Tx antennas that guarantee great 
distances between the symbol sets, thus attaining amazingly 
low error rates at the MIMO decoder output, [ 10 1. Nonetheless, 
they suffer from two types of drawbacks, which are both 
related to the decoding. The ML decoder of an MIMO code 
finds the most likely set of transmitted symbols, but in the 
same context it is hard to extract any reliability information, 
let alone bitwise soft decisions. This is tantamount to saying 
that an MIMO code with a slightly more modest ML decoding 
performance can be highly competitive in cases where soft 
decision MIMO decoding and FEC codes performance are 
considered. Besides, ML decoding of an algebraic MIMO code 
is a toilsome operation despite the obvious relief provided by 
the sphere decoding algorithm ifTTI . The decoding complexity 
is a real problem if the algebraic MIMO codes were meant to 
be used for very high data rate transmission. 

The soft decision MIMO decoding is covered to some 
extent in existing literature. By combining the MIMO detector 
and the FEC decoder, Larsson et al. [12] strives for overall 



optimality and covers quite a lot of the theoretical aspects of 
the problem. Their mathematical model is relatively complex, 
however, which in turn explains why their numerical examples 
are limited to BPSK and (7, 3) Hamming code. The tractability 
of more powerful MIMO codes in combination with the variety 
of constellation mappings and FEC codes is still an issue, 
which lfl2ll - despite its theoretical significance - does not 
solve. 

In the current paper, we study the bitwise soft information 
and show how it can be extracted and delivered to the FEC 
decoder - with a startlingly low complexity though still in a 
nearly optimum way. As the measure of reliability, we first 
take bitwise LLR, from which we develop the vector LLR 
metric that contains the sufficient statistic for FEC decoder so 
as to be able to find the ML code word. In the process, we need 
a demodulator that directly calculates the bitwise LLR values 
instead of just the nearest constellation points and possibly 
their probabilities. It turns out that suboptimum solutions can 
be very simple, and we shed light on the basic means to attain 
the low complexity without forfeiting performance. 

The key point of the proposed approach is the MIMO 
decoder and the combining of the signals received by multiple 
antennas. It is very important for the MIMO combiner not 
to lose information and for this reason we reproduced some 
analysis of the MRC limit achieving combiners for the 1 x TVrx 
and 2 x N Rx orthogonal MIMO codes i.e. for MRC and 
the Alamouti code. Further, we show how those combining 
algorithms can be extended to the orthogonal MIMO codes 
for three and four Tx antennas as well. Hence, we introduce 
the soft decision signal combiners for the rate 3/4 orthogonal 
complex MIMO codes to reach the MRC limit, also preserving 
the low complexity structure of the rate 1 MIMO combiners. 
The novel combiners give us the tools to achieve and additional 
gain of 2 to 3 dB, which in turn renders the orthogonal MIMO 
codes for three and four Tx antennas much more competitive 
in systems with an FEC code. 

The outline of the paper is as follows. In Section II, we 
discuss our MIMO transmission model, and its mathematical 
presentation. Section III contemplates the best form for the 
soft decisions to be passed from one module to another. We 
shed light on LLR and discuss its optimality. The combining 
algorithms of MRC and the Alamouti code are presented as 
examples to demonstrate the behaviour of the noise variance 
in the process. In Section IV, we present the novel soft de- 
cision decoding and combining algorithms for the orthogonal 
complex MIMO codes for three and four Tx antennas, and 
simulation results of their performances are shown in Section 
V. Finally, in Section VI we summarise the results. 

II. The MIMO Transmission Model 

For the mathematical tractability, we require that the channel 
response between the Tx — Rx antenna pairs can be expressed 
as a complex scalar value. In the case of the impulse response, 
this means flat fading channels, which arent normally encoun- 
tered in practice. Therefore, we might prefer considering the 
frequency responses instead. The underlying system is thus 
essentially OFDM, though we do not specify any OFDM 
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G is the / x Nj x MIMO code generator matrix, 
Hi,i(m) ... H ltNsx (m) 



H 



fli,i(m + J-l) ... H hNRK (m + 1 - 1) 



is the Nj x x Ar x channel frequency response matrix that is 
assumed to be constant over the I consecutive channel usage 
instants, and 



W 



Wi(m) 



Fig. 1. The system model and soft decision flow in the receiver. 



LWi(m + Z-l) 
is the additive noise matrix. 



WN^m + l-1) 



symbol or frame structure in this context. Instead, we apply 
the idealised Rayleigh fading channel model, which does not 
take into account the variety of sub-tones and other OFDM 
specific issues. 

The model is depicted in Fig. [T] where the (OFDM) MIMO 
channel block in the middle represents the MIMO channel 
combined with the underlying OFDM processing. The soft in- 
formation that is obtained in the MIMO decoder and combiner 
is transformed into bitwise LLR in the demodulator and then 
delivered to the FEC decoder via deinterleaver. The part of 
the receiver where the signal flow consists of soft decisions 
is indicated by the soft information chain arrow, and it forms 
the essence of our MIMO receiver. 



A. Mathematical Presentation 

We refer to the MIMO symbol as the set of constellation 
points that are transmitted simultaneously from Nj x antennas 
in a single instant. The MIMO code word consists of I 
consecutive MIMO symbols. Let us consider the transmission 
of one MIMO code word, which is composed of the infor- 
mation bearing constellation points S^. It is transmitted in I 
consecutive instants from iVr x and received by Ar x antennas. 
Whether the instants are adjacent in time or frequency is 
unimportant in the mathematical modelling. What is important 
is the set of channel responses, which must remain constant 
or at nearly constant to justify the approximation (m) ss 
Hij (m + 1) « Hij (m + I + 1), where i and j stand for 
the Tx and Rx antenna indices respectively. The index m 
refers to the starting point of the MIMO code word in the 
transmission chain. The samples received by ,/Vrx antennas 
during I consecutive instants are expressed as 

R = GH + W (2) 

where the matrix of the received samples is written as 

Ri (m) . . . i?jv Rx (m) 
R= : : 

_R 1 (m + l-l) ... R Nsx (m + l-l)_ 



B. The Rayleigh Fading Channel Model 

The Rayleigh distributed complex channel coefficients were 
composed of independent Gaussian 7V(0, 1/2) distributed real 
and imaginary parts i.e. their expected energy was one. The 
links between all Tx — Rx antenna pairs were independent i.e. 
there was no correlation between the coefficients Hij(m). 

A new channel was generated at the beginning of each 
MIMO code word. In the case of 3 x and 4 x N Rx MIMO, 
this implied a constant channel during four successive MIMO 
symbols, while for the Alamouti code, the channel changed 
after every two symbols. This is as close to full interleaving 
as we can get while keeping the channel constant within 
MIMO code words. The final shuffling was done by a random 
interleaver, the length of which was several thousands bits; the 
purpose of this shuffling was to scatter the bits within a single 
MIMO word all along the FEC code word. Hence, the model 
can be described as a fully interleaved flat fading Rayleigh 
channel. 

III. The Soft Decision Flow in the Receiver 

The basic idea behind delivering the soft decisions from 
the MIMO decoder to the FEC decoder is depicted in Fig. [T] 
Essentially three components that are subjects of concern: the 
MIMO decoder, the demodulator and the FEC decoder. The 
final aim is to deliver all information to the FEC decoder that 
is necessary for it to be able to find the ML code word. The 
way to achieve this is discussed in the following subsections. 

A. The Bitwise LLR Metric 

There are basically two ways to obtain an optimum FEC 
decoder. The traditional way is ML decoding, which is able 
to find the most probable code word and then output the 
corresponding information word. The second way is to max- 
imise the a posteriori probabilities (APP) of the information 
word symbols by applying the renowned BCJR algorithm 1131 . 
Berrou et al. in the seminal paper [ 14 | very cleverly introduced 
an iterative decoding method based on the BCJR algorithm in 
which information is passed between the constituent decoders 
in the form of LLR. Berrou's invention led to amazingly good 
performance of the code, which was aptly named the 'turbo 
code'. 
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Fig. 2. QPSK constellation with Gray mapping, real and imaginary parts 
separated. 



The bitwise LLR is the logarithm in which the likelihood 
ratio of the bit is one versus zero, i.e. 



X - In ( V[Xq = X) 



(3) 



It is as optimal as the probabilities themselves - which can 
easily be solved from A g . The use of LLR directly in decoding 
instead of probabilities often leads to a simpler decoder - 
without compromising optimality e.g. it is a straightforward 
way to show that the summation of branch metric values that 
are in the form of LLR in a Viterbi decoder lead to the ML 
decoding result. 

In the general case, calculating the true LLR is not a very 
simple task. The probabilities in a multilevel modulation come 
from multiple constellation points, which make the optimum 
demodulation less attractive. However, excellent approxima- 
tions can be obtained with relative ease 1151 . lfT6l . To shed 
light on the bitwise LLR, we first consider BPSK modulated 
transmission of symbol values \fTT s and —\/E~ s over the 
additive Gaussian channel Af(0, a 2 ). With the received symbol 
R q the probability density functions (pdf) become 



p(x q = 1) 
and 

p(x q = 0) 



1 



V2i 



V2i 



-(R q +VE:) 2 /(2^) 



from which we obtain by direct calculation 



A„ 



-R a 



(4) 



(5) 



This result can be generalised to QAM constellations as 
well. Suppose we have separate bit mappings onto real and 
imaginary parts of the constellation points and the Gray 
mapping of the bit vectors is used. Then the approximation 
of LLR can be obtained by measuring dist (R q , S m a), which 
is the distance from the received point to the midpoint between 
the constellation points that give the opposite bit decisions and 
calculating 

^^dist(i? 9 ,S , mid ) , (6) 



A„ 



where d const is the distance between two constellation points, 
and all distances are measured either in real or imaginary 
dimensions. 
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Fig. 3. 16-QAM constellation with Gray mapping, real and imaginary parts 
separated. 



A simple example is the QPSK constellation depicted in 
Fig. |2j where the bitwise LLR values that are not approxima- 
tions are obtained in the case E, = 1 as 



Ai 



V2x 

rr2 ■ 



Ao 



V2 



(7) 



A lucid example of the simplicity of the multilateral bitwise 
LLR metrics is the 16-QAM constellation depicted in Fig. [3] 
where the approximate LLR values in the case E s — 1 are 
calculated as 



A 3 
Ai 



2.r 



10 a 2 ' 

2y 



10CT 2 



A 2 
Ao 



-2 



10 a 2 
2 

10~o^ 



v 



(8) 



Hence, obtaining the approximate bitwise LLR can be even 
simpler than the hard decisions. Further, the approximations 
tend to be very close to the true values - especially for 
the small X q values that are crucial to the FEC decoder's 
performance. The sub-optimality of ([8]), then, has only a minor 
impact on the bit error rates (BER) at the FEC decoder 
output. According to our simulations, it is hard to discern any 
differences between the optimum and sub-optimum 16-QAM 
demodulations. 



B. The Vector LLR Metric 

We aim to obtain the reliability metric that establishes a 
fair comparison between the code words - or code vectors, 
which are essentially the same - so that the ML decoder is 
able to pick the most probable code word. Given a set of 
binary vectors of length L and the independent bit probabilities 
p(x q ), the probability of the entire vector p(x) = n g =o P( x q)- 
Clearly f>(x) > p(y) implies that x is more probable than y. 
For the complement of x, p(x) = Jl^o 1 ~ Pi x q))> anc ^ me 
condition p(x) < p(y) holds if x is more probable. Therefore, 
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Fig. 4. The code words and code word complements presented as subsets 
of all binary vectors. 



L-l 



P(x q ) 



the probability ratio 

sorts the vectors so that the vector x is more probable than 
the vector y whenever 

p( x ) p(y) 

p(y) ' 

Using the logarithm does not change the order, since logarithm 
is a monotonically increasing function. Examining subsets 
instead of the complete set does not change the order either, 
which implies that the logarithm of the probability ratio 
sorts the words in the union of code words and code word 
complements, depicted in Fig. |4] so that most probable appears 
first. 



Therefore, maximising the metric 



M (A) 



L-l 

E 

9=0 



(9) 



among the code words [cq c\ ... cl-i] cranks out the 
ML code word. The binary code word components in 
c q € { — 1, 1} with the bit zero mapped onto — 1. 

C. The Signal Energy and the Noise Variance 

Obtaining the X q values and consequently the metric Mo (A) 
seem very simple - and it is, provided the noise variance 
(T 2 is same for every bit. However, in general it is not. The 
assumption that a 2 is same at every Rx antenna is realistic, 
but what happens when the received raw signal is equalised? It 
is assumed that the phases and amplitudes of the constellation 
points are restored before reaching the demodulator, which 
essentially divides the received samples by the channel fre- 
quency responses Hij. This implies that the additive noise 
samples also become divided by and their variance by its 
square, i.e. by H*jHij. Since a 2 resides in the denominator of 
the expression of \ q , each X q becomes multiplied by HfjHij 
as a result of the equalisation. Therefore, if the additive noise 
variance is same at all Rx antennas throughout the FEC code 
block, then a 2 in the expression of X q can be set to unity 
provided we at same time multiply the X q values by the energy 
of the channel coefficients that are related to the transmission 
of the given bit. In the MIMO case, this channel energy term 



is not just a single H*^Hij but the double sum over all ij 
combinations. 

At this stage we have lost the probabilistic interpretation of 
Mo (A), though this could be restored by finding a coefficient 
that would scale the sum of probabilities to the unity. This 
is unnecessary, however, since we are interested in the order 
of the Mo (A), which is not affected when all metrics are 
multiplied by a same constant. 

As an example, consider the transmission of a constellation 
point S as the mth symbol in the transmission chain. The 
MRC combiner of the samples obtained from Nr* antennas 
calculates the estimate 



S(m) 



N Rt 

E 



(10) 



3=1 



where the sum in the denominator is the channel energy term, 
by which the X q values are to be multiplied. 

As the second example, consider the Alamouti coded trans- 
mission from two Tx antennas 



R = G 2 H + W 
where the Alamouti code matrix 

G 2 



(11) 



m = 0, 2, 4, 



Si(m) S 2 (m) 
S%(m) S*(m)_ 

The Alamouti combiner calculates the estimates 

Rj (m)fffc (m) + J2 R *j ( m + V H V ("*) 



Si(m) 



3 = 1 



3 = 1 



5 2 (m) 



Nrx 2 
3=1 i=l 

J2 R3 {m)H* 2j (m) - J2 R *3 (™ + U ( m ) 

3=1 3^1 

Ns* 2 : 

3=1 <=1 



(12) 

where the double sum in the denominator i.e. the channel 
energy term is the same for both symbols. 
To generalise, we state that maximising 

L-l JV Rx Af Tl 

M(A) = s § n ( c ?) E E ^ (m)^(m)A 9k2=1 (13) 

q=0 j-1 i=l 

leads to the ML decision on the transmitted code word. 
Note that although the double sum energy term varies in 
the transmission chain index to, a change can only occur at 
the beginning of a new MIMO code word e.g. the energy 
term is the same for every X q , the constellation points of 
which are transmitted within the same channel usage interval 
to ... m + / — 1. 

Note that the double sum channel energy term that appears 
in the expression of M(A) has exactly the same form in 



IEEE TRANSACTIONS ON INFORMATION THEORY, IT PAPER 9-1014, SUBMITTED 9 DECEMEBER 2009 AS A REGULAR PAPER 



6 



the denominator of the Alamouti combiner, and it can be 
found in the MRC combiner as well by noticing there is 
only one Tx antenna to sum up. This coincidence might 
lead people to think that the optimum way to decode and 
combine the MIMO encoded signals involves the channel 
energy, which in fact it does - we do not propose any other 
type of MIMO combiners in this paper. Further, the energy 
term we introduced in M(A) did not come from the Alamouti 
combiner, but directly resulted from restoring the amplitude 
and phase of the transmitted symbol by dividing the received 
samples by Hij. Therefore, the optimality of M(A) is not 
restricted to the cases we present here. 

IV. The Orthogonal Rate 3/4 MIMO Codes 

Tarokh et al. called the rate 3/4 codes for three and four Tx 
antennas that they had discovered 'sporadic' Q. At that time, 
of course, they were sporadic in the sense that there were 
no rules on how to construct similar codes beyond four Tx 
antennas. Later, Liang discovered there rules 0, which state 
that e.g. for five and six Tx antennas there exist rate 2/3 codes. 
Further, the code generators of Tarokh et al. are not unique and 
the forms presented in [5 | are more convenient for decoding 
since the generator matrices contain only constellation points 
and their complex conjugates. However, we did not opt for 
either of the generators above mentioned, but selected yet 
another constructions presented by Giannakis et al. [17|. The 
generators of are essentially the same as those in with a 
few columns interchanged. The reason for using the generators 
of ifTTl is simple: having the constellation points and their 
conjugates well ordered in the generator matrix makes the 
decoding processes more straightforward, and it ultimately 
satisfies important optimality criteria. 

A. The Code for Three Tx antennas 

The generator matrix of the code ifTTl is 



Si 
-S* 
-S* 3 





S 2 

Si 


-ss 



s, 



S'l 

sz 



(14) 



where [S'l 52 5*3] are the three information bearing constella- 
tion points that are transmitted in the four consecutive instants. 
The signal received by the MIMO decoder is modelled as 



R = G 3 H + W 



(15) 



where 



R 



Ri(m) 
Ri(m + 1) 
i?i(m + 2) 
Ri(m + 3) 



Rn Rs (™) 

R Nsx (m + 2) 
R NRx (m + 3) 



G is the I x At x MIMO code generator matrix, 



H 



ff M (m) 
#3,1 (to) 



H 2: N Rl ( m ) 
H 3,N Rl ( m ) 



The dimensions and indexing of W are identical to those 
of R. On the receiver side, the minimum variance unbiased 
estimators of the transmitted constellation points are calculated 
in a fashion that resembles the Alamouti combiner. The 
equations derived in Appendix |A| are given in ( 16 1, where for 
brevity, the transmission chain index m is only marked onto 
Rj. 

The combiner of the orthogonal rate 3/4 code for three 
Tx antennas bears an obvious resemblance to both MRC and 
Alamouti combiners, and it can be considered an extension 
of them. As shown in Appendix [B] it even satisfies the 
MRC limit, thus attaining the highest achievable SNR in the 
combined estimates. 



B. The Code for Four Tx antennas 

The construction is very similar with that in Section IV-A 
The generator matrix of the code [ 17 1 is 



G :i = 



' Si 
-S* 2 

-si 





s 2 

Si 


-ss 



S3 



si 

si 





S3 
-s 2 

Si 



(17) 



where [Si S 2 S3] are the three information bearing constella- 
tion points that are transmitted in the four consecutive instants. 
The signal received by the MIMO decoder is modelled as 



R = G 4 H + W 



(18) 



wherethe R and W are the same as in the previous section 



and 



H 



H 2 ,i(m) 
H 3A (to) 
£74,! (to) 



H 2tNRi (m) 

H 3,N Rl ( m ) 
H4,N Rx (TO) 



conforming to the assumption Hij(m) . . . Hij(m + 3). 
The minimum variance unbiased estimators of the transmit- 



ted constellation points are given in (19 1, where for brevity, 



conforming to the assumption iZy(m) . . . Hij(m + 3). 



the transmission chain index m is only marked onto Rj. The 
derivation that is essentially similar to the one with three Tx 
antennas is omitted. 

Again the similar results are obtained. The combiner of the 
orthogonal rate 3/4 code - this time for four Tx antennas 
- resemblances the combiners for one, two and three Tx 
antennas and can be considered an extension of them. The 
combiner also satisfies the MRC limit, the proof of which is 
essentially similar to that where there were three Tx antennas 
is omitted. 

V. Performance Comparison of the Orthogonal 
MIMO Codes 

A. The Demodulator Output 

When studying the performance without the FEC code, the 
abscissa of the curves is the average SNR received by one 
antenna and is not to be mixed with the compound SNR. 
Examining the BER vs. SNR curves at the demodulator output 
is tantamount to studying bitwise hard decisions, a topic which 
has already been addressed in the literature. However, we 
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Si(m) 



S 2 (m) = 



5 3 (m) 



N Rl N Rl N Rx 

53 Rj (m)H^ + R ] (™ + l)#y + X ^ ( m + 2 ^ 

3=1 3=1 3=1 

Wrx 3 ! 

XX // <' // '-' 

3=1 *=1 

^ ^ {m)H* 2j - R* 3 (m + 1) #ii + X R *i ( m + 3 ) H V 

3 = 1 3=1 J=l 

JVrx 3 : 

3=1 *=1 

iVRx ATrx JVrx 

53 i?, (m)^ - ^ i?* (m + 2)ZTy - ^ R* (m + 3)H 2j 

3=1 3=2 3=2 

Wrx 3 

3=1 «=1 



(16) 



Si(m) 



Wrx 



5 2 (m) = 



S 3 (m) = 



N Rl 



ATr* 



A'Rx 



^ i?j (m)fl^ + 53 R *J ( m + !) ff 2,- + X ^ (™ + 2 ) i/3 J + X fl 3 ( m + 3 )#4. 

3=1 



3 = 1 



3=1 



! 3J ^Z^- 

3 = 1 



43 



JVrx 4 

EE^ fl y 

3=1 *=1 

ATrx Afe* ATr, Af Rx 

53 i?j (m)i^ - X R *j ( m + !)^i3 - X ^ (™ + 2)^- + X R 3 ( m + 3 )^3 

3 = 1 3=2 3 = 1 3=2 

ATrx 4 

EE ff y fl y 

3 = 1 *=1 

A^Rx ATrx A r R x ATrx 

53 i?j (m)^ + 53 ii, (m + 1)^ -53 fl* (m + 2)£T y - 53 i?* (m + 3)iJ 2j 

3 = 1 3=2 3 = 1 3=1 

Wrx 4 

EE^ ff, j 

3 = 1 «=1 



(19) 



present some curves that are essential to understanding the 
diversity gain that is attainable by concatenation of the MIMO 
and FEC codes. Proper yardsticks for comparisons are found 
in the seminal papers of Alamouti and Tarokh et al. (2), 
In Fig. [5] the comparison of MRC and the Alamouti code is 
done by extending the results in [2| to 8-PSK. The 3 dB shift 
is worth noting between MIMO schemes with an identical 
diversity order, which is visible both in our and Alamoutis 
curves. This can be intuitively explained by noting that even 
though the energy and noise terms of the MRC and Alamouti 
combiners look as if they were the same, the first Alamouti 
combiner equation removes the second transmitted symbol 
whereas the second one removes the first symbol. That is to 
say, with only one symbol energy the Alamouti combiner loses 
3 dB in comparison with MRC. 

Fig. 6 is an extension of the results in (4) to multiple Rx 
antennas. In comparing the 1x4 and 4x1 MIMO schemes, we 
see that the BER curves are parallel, and it is straightforward to 



expand the MIMO combiner equations and show that they at- 
tain the same diversity order. However, the difference between 
the curves is 6 dB, which has an obvious explanation. Namely, 
the combiner cancels two parallel symbols, in addition to 
which the MIMO code rate 3/4 reduces the efficiency. Indeed, 
101og 10 (l/3-3/4) = — 6dB, though we cannot claim that this 
result is accurate due to different constellation mappings. The 
same difference can also be seen between the 1x8 and 4x2 
MIMO curves. 

The diversity order of an Nj x x Ngx MIMO is the number of 
branches i.e. the connections from the Tx to the Rx antennas, 
which is the product N Tx N Rx . The 1 x N Rx MIMO is ideal in 
the sense that the expected received signal energy increases 
linearly in the diversity order. Adding more Tx antennas 
increases the diversity order but not the expected received 
signal energy, from which we can infer that an A^ Tx x 7V Rx 
MIMO receives l/N Tx of the energy of MRC that has the 
same diversity order. 
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Known Rayleigh channel, 
3 bits per channel use 



■ MRC/8-PSK 

■ Alamouti/8-PSK 

■ 1 Rx antenna 

- 2 Rx antennas 

- 4 Rx antennas 
• 8 Rx antennas 




Fig. 5. Demodulator output BER of MRC and the Alamouti code in 8-PSK 
constellation mapping. 
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Fig. 6. Demodulator output BER of the rate 3/4 MIMO codes in 16-QAM 
constellation mapping. 



To generalise, we state that for the combiner of any or- 
thogonal complex MIMO code, the expected received symbol 
energy in comparison with MRC can be upper bounded as 



A iVTs = 101og 10 (l/iVTx) [dB], 



(20) 



where we of course assume the same diversity orders. The 
bound was seen to be met with equality in the case of two 
and four Tx antennas, meanwhile the case of three Tx antennas 
was slightly inferior. 

To prove that the bound ( |20| ) is met with equality for 
any even N Jx and square generator matrix we note that in 
accordance with ([TJ, the maximum rate MIMO code has 
TVxx/2 + 1 complex information bearing constellation points. 



If the combiner equations exist, each must cancel the energies 
of all other Nj x /2 symbols leaving just one, from which we 
obtain the relative loss 



1 



\N Jx /2] + 1 

2 r^Tx/2] ' N Tx /2 + l 



1 



(21) 



An even Nj x and a square generator matrix form the ideal 
case, in comparison to which an odd Atv Tx and a rectangular 
generator matrix are always inferior. As an example, the 
generator of the orthogonal code for three Tx antennas is 
a rectangular 4x3 matrix, and indeed, its combined signal 
energies remain slightly below the bound. 

When the bound A jy Tx is achieved, the orthogonal complex 
MIMO codes look as if they were optimal, which they arent — 
except for the Alamouti code. With more than two Tx antennas 
the number of information bearing constellation points remain 
lower than Ajv Tx , and the energy is bounded in Ajv Tx - 

There are non-orthogonal codes that are able to outperform 
the orthogonal ones when the error rates at the MIMO decoder 
output are the yardstick. Damen et al. Q reported that even 
the Alamouti code could be outperformed by a witty algebraic 
MIMO construction, which is a significant result. However, 
applying soft decisions and measuring the performance at 
the FEC decoder output is a different matter, and in this 
case the Alamouti code has yet to be beaten. With a larger 
number of Tx antennas, it seems feasible that the orthogonal 
complex MIMO codes could be outperformed — even with the 
FEC code — though the question of the soft decision MIMO 
decoder remains to be settled. 



B. The FEC decoder Output 

The performance of the system with the FEC code is 
measured as BER versus Ei,/Nq, where Eb is the average 
information bit energy received by one antenna and iVo is the 
single sided noise spectral density. The FEC code we used in 
simulations was a simple rate 1/2, memory 6 convolutional 
code with the generators 133g and 171g. In the case of rate 
3/4 MIMO codes, the total code rate was adjusted to 1/2 
by puncturing the convolutional code with the matrix [{ J] to 
reach the rate of 2/3. The simulation results with 16-QAM 
over the fully interleaved flat Rayleigh channel are presented 
in Fig. [7] There we applied the full soft decision flow including 
the bitwise LLR demodulator as depicted in Fig. [T] 

What is surprising in our results is that the performance 
is mainly dictated by the number of Rx antennas, while 
the number of Tx antennas and the MIMO code itself have 
only a minor impact on it. The explanation can be discerned 
in Fig. [6] where the curves with the same number of Rx 
antennas are located in a same bundle at the demodulator 
output BER > 10 2 . The Alamouti code curves in Fig. [5] 
closely fall into the same bundles, while MRC is slightly 
inferior. This is the raw error rate where the convolutional 
code starts to provide substantial coding gain, and where the 
differences between the MIMO schemes are determined — or 
at least those differences we have the recourses to simulate. 
At extremely low BER, we obtain additional gain from the 
larger diversity order. 
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16-QAM over known 
Rayleigh channel, 
total code rate 1/2 



■ MRC, FEC rate 1/2 

■ Alamouti, FEC rate 1/2 

■ Tx3, FEC rate 2/3 

■ Tx4, FEC rate 2/3 

- 1 Rx antenna 

- 2 Rx antennas 

■ 4 Rx antennas 

■ 8 Rx antennas 




16-QAM over known Rayleigh channel, 
2 Rx antennas, total code rate 1/2 
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Fig. 7. Performances of the soft decision MIMO codes measured at the FEC 
decoder output. 



Fig. 8. Comparison of the hard and soft decision performances at the FEC 
decoder output. 



C. Hard Decision vs. Soft Decision 

Although the curves in Fig. [7] manifest reliable transmission 
at remarkably low E^/Nq, they do not tell exactly how much 
of a gain the soft decision MIMO coding with bitwise LLR 
demodulation has provided. Thus, we select two Rx antennas 
as a representative case to present the comparisons between 
the hard and soft decisions in Fig. [8] In the third, the energy 
scaling curve, the MIMO decoder made hard decisions, but 
the bit values were later mapped onto the set {1,-1} and 
multiplied by the channel energy. The gain from the soft 
decisions was nearly 3 dB and it was even larger in MRC. 
This is a very significant result. The energy scaling yields the 
gain of 0.5 to 1.0 dB, and the rest comes from the soft decision 
MIMO combiners and bitwise LLR demodulator. 



D. Performance in OFDM over MIMO UWB Channel 

The fully interleaved Rayleigh channel is an idealisation 
that is rarely if ever encountered in practice. To gain some 
understanding of the real value of the soft decision MIMO 
receiver and orthogonal MIMO codes, we present some sim- 
ulation results over a UWB channel in Fig. [9] The simulated 
model was OFDM and compatible with the ECMA- 368 
standard ifTSl . The FFT size was 128 and the OFDM symbols 
consisted of 100 data tones, 12 pilot tones 10 guard tones 
and 6 zero tones. Further, there were training symbols in front 
of each OFDM frame. This is the explanation for the higher 
Eb/N values in Fig. [9] compared with Fig. [8] — the UWB 
channel does not differ so much from the fully interleaved 
Rayleigh. The channel that is described in detail in lfT9l 
was CM1, with a frequency band from 3168 to 3696 MHz; 
here the correlation between the antenna links was generated 
by a simplified model reported in [20|. The channel was 



16-QAM, 

code and receiver comparison 
with 2 Rx antennas 



Alamouti code, rate 1/2 
Orthogonal Tx3, rate 9/1 f 
Orthogonal Tx4, rate 9/1 C 
sofl dec, LSE chEst 
hard dec, ideal chEst 
sofl dec, ideal chEst 




Fig. 9. Comparison of the orthogonal code performances in UWB OFDM 
transmission. 



kept constant during the OFDM frames and changed at the 
beginning of a new frame. 

We were forced to make one small amendment to ECMA- 
368 when constructing the simulation model. Viz. the 16- 
QAM mapping in the standard did not suit our bitwise LLR 
demodulator, and it was changed to the Gray mapping depicted 
in Fig. [3] 

The FEC code given in ECMA-368 is a memory 6 con- 
volutional code with the generators 133s and 145s. It has 
exactly the same weight enumerator as the FEC code used 
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in the simulations over the Rayleigh channel, which makes 
their performances equal — unless the codes are punctured. 
There was no proper rate 2/3 puncture pattern for the ECMA- 
368 code, which is why we applied the rate 3/4 puncturing in 
connection with the rate 3/4 MIMO codes. The resulting total 
rate 9/16 is close enough to 1/2 to justify nearly the same 
comparison. 

In OFDM transmission over the UWB channel, the rate 3/4 
orthogonal codes outweighed the Alamouti code by 2 to 2.5 
dB. Such a difference can be explained by noting that there 
were subtones in relatively deep fade that affected performance 
more than the randomly scattered Rayleigh fading antenna 
links. This in turn means that the gain from a larger diversity 
order becomes substantial. In Fig. [9] we also included curves 
that demonstrate the impact of channel estimation. Our com- 
biners are optimal only when the channel is known, and there 
is a risk that inaccurate estimates deteriorate the performance. 
In the current example, the deterioration had the same order 
of magnitude as the improvement from the soft decisions. 

VI. Summary 

We have extended the soft decision decoding and signal 
combining algorithms of MRC and the Alamouti code to 
the orthogonal MIMO codes for three and four Tx antennas. 
The complexities of the novel combiners do not essentially 
exceed that of the Alamouti combiner, making it very attractive 
in terms of implementation. With the soft decision MIMO 
combiners, the FEC decoders attain the same error rates with 
2 to 3 dB smaller data bit energy than with the hard decision 
MIMO. The novel combiners require a close to constant 
channel over four instants, implying that they are better suited 
for low mobility environments. 

The soft decision MIMO combiners provide the largest gain 
when linked to a bitwise soft decision demodulator. Extracting 
the bitwise LLR values from the combiner output symbols was 
shown to be a very simple task — even though a multilevel 
constellation is a matter of concern. The bitwise LLR values 
weighted by the pertaining channel tap energies encompass 
the sufficient statistic for the FEC decoder so as to be able to 
find the ML code word. 

Although the diversity order of the orthogonal MIMO codes 
for 1 to 4 Tx antennas depends only on the number of Tx - Rx 
antenna links, this doesnt mean that they perform identically. 
One Tx antenna is the ideal because 101og 10 (1/Nj x ) dB of 
received energy is lost when new Tx antennas are added and 
the number of Rx antennas is reduced so that the diversity 
order remains the same. A combiner of an orthogonal complex 
MIMO code for an even numbers of Tx antennas can be able 
to collect all available energy. Despite that the coding is not 
optimum, since the number of information bearing symbols 
remains below Nj x . 

Appendix A 

The Orthogonal Code for three Tx Antennas: 
Derivation of the Decoding and Combining Rules 

For brevity of notation, we omit the transmission chain 
index to from Si and Hij, which are not subject to change 



within MIMO code words. The encoding rule R = G3H + W 
written out into the set of equations is thus 

R 3 (m) = StHti + S 2 H 2j + S 3 H 3j + W 3 (m), 
R 3 (m + 1) = -S^Hij + S\H 2j + • H 3j + Wj(m + 1), 
R 3 (m + 2) = -S|#y + • H 2j + StH 3j + W 3 (m + 2), 
R 3 (m + 3) = • Hij - S%H 2j + S* 2 H 33 + W 3 {m + 3). 

In decoding we consider first the case with one Rx antenna. 
To obtain the estimate of Si at the j'th Rx antenna we calculate 
the auxiliary variable 

Sij = Rj{m)Hl 3 - + R*(m + l)H 2j + R*(m + 2)H 3j , 

which expands to 

£y = [Si#y + S 2 H 2j + S 3 H 3] + W 3 (m)] fl* 
+ [S 2 H^ + S X H^ + W*(m + 1)] H 2j 
+ [-S 3 #y + SiHZj + W*(m + 2)] H 3j 
= [HljHtj + H%jH 2 j + H^H 3j ] Si 
+ Wj(m)Hlj + W*(m + l)H 2j + W*(m + 2)H 3j . 

To estimate S 2 , we calculate 

S 2j = R 3 {m)H* 2j - R*(m + l)jTy + R*(m + 3)H 3j , 
and expand it to 

S 2j = [SiHi 3 + S 2 H 23 + S 3 H 33 + W 3 (m)] H* 2j 

- [-S 2 Hl 3 - + SiH* 3 + W*(m + 1)] Hij 
+ [-S 3 H* 2] + S 2 Htj + W* (to + 3)] H 3j 

= [H*jHij + H 2] H 2j + H^jH 3j ] S 2 

- W*(m + l)Hi 3 + W 3 {m)H* 2] + W*(m + 3)H 3j . 

The third auxiliary variable we choose as 

S 3j = R 3 {m)H* 3o - R*{m + 2)ify - R*(m + 3)H 2j , 
and expand it to 

S 3j = S H. , + S 2 H 2j + S 3 H 33 + Wj(m)} H* 3j 

- [-S 3 H{j + SiH* 3j + W*(m + 2)] #y 

- [S 3 H* 23 + S 2 H* 3] + W*(m + 3)] H 2j , 
= [H*jHij + H 2] H 2j + H^jH 3j ] S 3 

- W*(m + 2)Hi 3 - W 3 (m + Z)H* 2j + W*{m)H 33 . 

Consequently, the minimum variance unbiased estimators of 
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the transmitted constellation points for one Rx antenna are 

d Rj (m)H* 3 + R* (to + l)H 2j + R* (m + 2)H 3j 



>2j 



E 

i=l 

Rj (m)H* 3 - R* (to + l)H l3 + R* (m + 3)H 3j 

3 
i=l 



s 3j - 



Rj {m)Hl 3 - R* (to + 2)H l3 - R* (m + 3)H 2j 



;=i 



The extension to multiple Rx antennas is straightforward. 
The combined auxiliary variables become sums 

Su = E R i ( m ) H h + E R *i ( m + 

3=1 3=1 

+ J2 R j( m + 2 ) H 33> 
3 = 1 



JVrx 

E^m^ 


-E^(™ + 1 ) ff y 


3=1 


i=i 






+E^( m 

3=1 


+ 3)ff 3 „ 






E^H^ 


-E i? i( m+2 )- ff u 


3=1 








-E*> 


+ 3)^2,, 



which, in accordance with the one Rx antenna derivation, 
expand to 

iVRx 3 JVr* 

5i=EE HtjHijSi + E Hl.W^m) 

3=1 i=l 3 = 1 

N Rl JV Rx 

+ E ff 2iW*(m + 1) + E ^3iW*(m + 2), 

3=1 3=1 



Nr, 3 ATrx 

& = E E - E #y + 1) 

3=1 «=i 3=1 

JVrx -Nrx 

+ E ^ ^iM + E ^ (m + 3) , 

3=1 J'=l 



JVrx 3 Nr x 

& = EE - E // ' ; ir ; i >" + 2 ) 

3=1 «=1 3=1 

JVrx JV Rx 

- E HljWj (to + 3) + E H 3j W* (to), 

3=1 3=1 

Therefore, the minimum variance unbiased estimators of the 
transmitted constellation points for any number of Rx antennas 
become 



5 



Si 



lj 



E H *3 Hi - 



-, S 2 j 



S-2 



22 H *3 H ± 



S 



s. 



3j 



22 H *3 H i. 



from which substitutions of S\, §2 and S3 give the final signal 



combiner ( 16 1 presented in Section IV-A 



Appendix B 

SNR of the Combined Signal in the Orthogonal 
Code with three Tx Antennas 

For the purpose of abbreviated notation, we assume that 
expectation of the constellation point energy is unity, i.e. 
E (5*5) = 1. This is scaling that does not affect the validity 
of the derivation in the general case. The corresponding 
expectation of the noise energy in the symbol estimate S\j 
is given in eq. (22 1 , which with the noise variance a 2 further 
develops to 



E 



W 



,i=i 



J2 H *3 Hi - 



,i=i 



E H *3 H i: 



Therefore, the signal-to-noise power ratio at the jth Rx 
antenna becomes 



SNR 



Yl H i3 H k 



Combining all Rx antennas does not change the expecta- 
tion of the signal energy, which remains at unity. Provided 
the additive noise samples are independent and identically 
distributed (iid) among the Rx antennas, the expression for 
the corresponding combined noise energy becomes that of eq. 
@. 

Hence, the signal-to-noise power ratio of the combined 
estimate is 



SNR 5i 



ATrx 3 

EE // ', // " 

3 = 1 i=l 



which indeed is the sum of the constituent SNRs. The proof 
showing that the combined SNRs of the estimates 52 and S3 
are also the sums of their constituents is similar to the proof 
above. 
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E 



W 



Ml 



= E 



= E 



Wj^H* + WUm + l)H 2j + WUm + 2)H 3j 



y., i H *j H ij 

H^HijWjimyWjim) + H^H^W^m + l)*Wj(m + 1) + H^H^W^m + 2)*W J (m + 2) 



(22) 



E 



W 



= £ 



^ i^W^m) + J2 H 2o W ] {m + 1)* + £ # 3 ^(m + 2)* 



Wr, 3 

_j=l i=l 

JVrx 3 

EE^^J 

3 = 1 i=l 



Nr* 3 

EE^ 

j=l i=l 



2 JV Rx 3 

EE^ 

i=i »=i 



(23) 
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